Psychoacoustically-Driven Multichannel Audio Coding
نویسنده
چکیده
Multichannel audio rendering allows for the immersion of a listener into a rich acoustic scene, as compared to traditional stereophonic methods. Nevertheless, the excessive transmission and storage requirements of multichannel audio pose a considerable obstacle towards its widespread usage. A novel method is presented here with which a single channel of a multichannel signal is conventionally transmitted and stored while the remaining channels are resynthesized based on statistical conversion of the same, single channel-signal. The size of the parameters required for the resynthesis process is much smaller than the size of the original channel-signal for the same resulting audio quality. Mathematics Subject Classification: 68P30
منابع مشابه
Binaural cue coding-Part I: psychoacoustic fundamentals and design principles
Binaural Cue Coding (BCC) is a method for multichannel spatial rendering based on one down-mixed audio channel and BCC side information. The BCC side information has a low data rate and it is derived from the multichannel encoder input signal. A natural application of BCC is multichannel audio data rate reduction since only a single down-mixed audio channel needs to be transmitted. An alternati...
متن کاملTutorial for ISIMP-2001 Recent Developments in Advanced Audio Processing
When DVD and home theater systems become more popular these days, high fidelity multichannel (5.1 channel or 10.2 channel) audio systems are well received in the market. Compared with the traditional mono or stereo audio, multichannel audio requires a much more efficient coding scheme for its storage and transmission. This talk will present two new multichannel audio coding techniques: (i) the ...
متن کاملParametric Coding of Stereo Audio Based on Principal Component Analysis
Low bit rate parametric coding of multichannel audio is mainly based on Binaural Cue Coding (BCC). Another multichannel audio processing method called upmix can also be used to deliver multichannel audio, typically 5.1 signals, at low data rates. More precisely, we focus on existing upmix method based on Principal Component Analysis (PCA). This PCA-based upmix method aims at blindly create a re...
متن کاملMulti-channel psychoacoustically motivated speech enhancement
Multichannel techniques offer advantages in noise reduction and overall output signal quality when compared to the well studied mono approaches. In this paper we present an original multichannel psychoacoustically motivated noise reduction algorithm that naturally extends the single channel psychoacoustic masking filter previously studied in the literature [1]. The optimality criterion is desig...
متن کاملHigh-Fidelity Multichannel Audio Coding Second Edition
Preface Audio is one of the fundamental elements in multimedia signals. Audio signal processing has attracted attention from researchers and engineers for several decades. By exploiting unique features of audio signals and common features of all multi-media signals, researchers and engineers have been able to develop more efficient technologies to compress audio data. Although books on digital ...
متن کامل